Search CORE

96 research outputs found

STEFANN: Scene Text Editor using Font Adaptive Neural Network

Author: Bhattacharya Saumik
Ghosh Subhankar
Pal Umapada
Roy Prasun
Publication venue
Publication date: 25/04/2020
Field of study

Textual information in a captured scene plays an important role in scene interpretation and decision making. Though there exist methods that can successfully detect and interpret complex text regions present in a scene, to the best of our knowledge, there is no significant prior work that aims to modify the textual information in an image. The ability to edit text directly on images has several advantages including error correction, text restoration and image reusability. In this paper, we propose a method to modify text in an image at character-level. We approach the problem in two stages. At first, the unobserved character (target) is generated from an observed character (source) being modified. We propose two different neural network architectures - (a) FANnet to achieve structural consistency with source font and (b) Colornet to preserve source color. Next, we replace the source character with the generated character maintaining both geometric and visual consistency with neighboring characters. Our method works as a unified platform for modifying text in images. We present the effectiveness of our method on COCO-Text and ICDAR datasets both qualitatively and quantitatively.Comment: Accepted in The IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 202

arXiv.org e-Print Archive

Crossref

A CNN Based Framework for Unistroke Numeral Recognition in Air-Writing

Author: Ghosh Subhankar
Pal Umapada
Roy Prasun
Publication venue
Publication date: 14/03/2023
Field of study

Air-writing refers to virtually writing linguistic characters through hand gestures in three-dimensional space with six degrees of freedom. This paper proposes a generic video camera-aided convolutional neural network (CNN) based air-writing framework. Gestures are performed using a marker of fixed color in front of a generic video camera, followed by color-based segmentation to identify the marker and track the trajectory of the marker tip. A pre-trained CNN is then used to classify the gesture. The recognition accuracy is further improved using transfer learning with the newly acquired data. The performance of the system varies significantly on the illumination condition due to color-based segmentation. In a less fluctuating illumination condition, the system is able to recognize isolated unistroke numerals of multiple languages. The proposed framework has achieved 97.7%, 95.4% and 93.7% recognition rates in person independent evaluations on English, Bengali and Devanagari numerals, respectively.Comment: Accepted in The International Conference on Frontiers of Handwriting Recognition (ICFHR) 201

arXiv.org e-Print Archive

From Particle Mechanics to Pixel Dynamics: Utilizing Stochastic Resonance Principle for Biomedical Image Enhancement

Author: Kumar Roy Prasun
Subramanyam Rallabandi V.P.
Publication venue: 'IntechOpen'
Publication date: 10/10/2011
Field of study

IntechOpen

Validating a novel angular power spectrum estimator using simulated low frequency radio-interferometric data

Author: Ali Sk. Saiyad
Bharadwaj Somnath
Choudhuri Samir
Dutta Prasun
Ghosh Abhik
Roy Nirupam
Publication venue: 'Elsevier BV'
Publication date: 27/06/2017
Field of study

The "Tapered Gridded Estimator" (TGE) is a novel way to directly estimate the angular power spectrum from radio-interferometric visibility data that reduces the computation by efficiently gridding the data, consistently removes the noise bias, and suppresses the foreground contamination to a large extent by tapering the primary beam response through an appropriate convolution in the visibility domain. Here we demonstrate the effectiveness of TGE in recovering the diffuse emission power spectrum through numerical simulations. We present details of the simulation used to generate low frequency visibility data for sky model with extragalactic compact radio sources and diffuse Galactic synchrotron emission. We then use different imaging strategies to identify the most effective option of point source subtraction and to study the underlying diffuse emission. Finally, we apply TGE to the residual data to measure the angular power spectrum, and assess the impact of incomplete point source subtraction in recovering the input power spectrum

C_{\ell}

of the synchrotron emission. This estimator is found to successfully recovers the

C_{\ell}

of input model from the residual visibility data. These results are relevant for measuring the diffuse emission like the Galactic synchrotron emission. It is also an important step towards characterizing and removing both diffuse and compact foreground emission in order to detect the redshifted

21\, {\rm cm}

signal from the Epoch of Reionization.Comment: 18 pages, 1 table, 9 figures, Accepted for publication in New Astronom

arXiv.org e-Print Archive

Open Access Repository of IISc Research Publications

Ultrasonographic evaluation of anatomical variations, of the first dorsal compartment of wrist, and its correlation with de Quervain’s disease recurrence in Indian population: a hospital based longitudinal study

Author: Mandal Prasun
Mukherjee Soutrik
Roy Amlan J.
Publication venue: Medip Academy
Publication date: 11/07/2023
Field of study

Background: de Quervain’s disease is a commonly encountered debilitating condition mostly treated conservatively. Cadaveric studies have revealed anatomical variations in the first dorsal compartment of the wrist; however, a causal association between the variants and recurrence of the disease has not been established. An ultrasonographic assessment of the first dorsal compartment of the wrist was done, to correlate the response to treatment, of the different anatomical variants. Methods: 106 wrists were included in the study after clinically confirming de Quervain’s disease. All were injected with 20mg of methylprednisolone acetate in the first dorsal compartment of the wrist and were followed up. Patients who were pain free for at least six months, as well as patients who did not respond or had a recurrence of disease within eight to twelve weeks, were all assessed with ultrasound. Results: 57.54% of the wrists were cured with one injection most of which revealed a tendon arrangement of one abductor pollicis longus (APL) and one extensor pollicis brevis (EPB) and two APL and one EPB without any septation. 32.07% of the wrists did not respond to this treatment and most of them comprised of septations and aberrant compartments with presence of supernumerary tendons. Conclusions: Ultrasonographic assessment of the first dorsal compartment of wrist, of all patients suffering from de Quervain’s tenosynovitis must be done to predict the outcome of conservative therapy and customize an optimal treatment modality for the individual anatomical variant

International Journal of Research in Orthopaedics

Quantitative Risk Assessment in Titanium Sponge Plant

Author: Bhatt Arti
Rajagopal Chitra
Roy Prasun K.
Publication venue: 'Defence Scientific Information and Documentation Centre'
Publication date: 01/10/2004
Field of study

This pap& presents the quantitative risk assessment for the storage of titanium tetrachloride (TiCl,).It is the major reactant used for the production of titanium in the titanium spongeplant. Titanium tetrachloride readily reacts with moisture, leading to the release of toxic hydrogen chloride (HCI).F ire explosive and toxicity index analysis, and hazard and operability(HAZOP) studies for the entire titanium sponge plant were carried out. Based on these studies, the TiCl, storage section was found to be one of the most hazardous sections in the titaniumsponge plant. Fault tree analysis technique has been used to identify the basic events responsible for the top event occurrence, ie, release of HCl due to the hydrolysis of TiCl, upon contactwith moisture in the environment during spillagelleakage of TiCl, from the storage tanks and to calculate its probability. Consequence analysis of the probable scenarios has been carriedout. The risk has been estimated in terms of fatality.and injuries. Based on these results, basic input in the form of recommendations for possible changes in the design and operation of thetitanium sponge plant have been made for the risk management

Directory of Open Access Journals

Defence Science Journal

Effects of Degradations on Deep Neural Network Architectures

Author: Bhattacharya Saumik
Ghosh Subhankar
Pal Umapada
Roy Prasun
Publication venue
Publication date: 26/06/2019
Field of study

Recently, image classification methods based on capsules (groups of neurons) and a novel dynamic routing protocol are proposed. The methods show promising performances than the state-of-the-art CNN-based models in some of the existing datasets. However, the behavior of capsule-based models and CNN-based models are largely unknown in presence of noise. So it is important to study the performance of these models under various noises. In this paper, we demonstrate the effect of image degradations on deep neural network architectures for image classification task. We select six widely used CNN architectures to analyse their performances for image classification task on datasets of various distortions. Our work has three main contributions: 1) we observe the effects of degradations on different CNN models; 2) accordingly, we propose a network setup that can enhance the robustness of any CNN architecture for certain degradations, and 3) we propose a new capsule network that achieves high recognition accuracy. To the best of our knowledge, this is the first study on the performance of CapsuleNet (CapsNet) and other state-of-the-art CNN architectures under different types of image degradations. Also, our datasets and source code are available publicly to the researchers.Comment: Journa

arXiv.org e-Print Archive